Rank | Count | Beginning |
---|---|---|
3898 | 5223 | De |
13490 | 2419 | Het |
17553 | 1017 | In |
16283 | 710 | Hij |
10043 | 707 | Dit |
11242 | 562 | Een |
9239 | 547 | Deze |
22869 | 502 | Op |
22378 | 481 | Ook |
12221 | 471 | Er |
27015 | 430 | Volgens |
1985 | 388 | Bij |
3476 | 388 | Dat |
21011 | 332 | Na |
20271 | 328 | Met |
27462 | 310 | Voor |
1071 | 295 | Als |
29312 | 250 | Zij |
19774 | 213 | Maar |
11965 | 203 | En |
29690 | 175 | Zo |
25405 | 170 | Tijdens |
10863 | 167 | Door |
29071 | 164 | Ze |
18016 | 156 | Indien |
28729 | 141 | Wij |
147 | 140 | 2. |
25920 | 140 | U |
26653 | 140 | Verder |
17323 | 136 | Ik |
In the next four subsections show the most frequent sentence beginnings consisting of N words, N=1, 2, 3, 4. In this subsection we start with N=1.
The most frequent word-N-grams at the beginning of sentences give some insight into sentence composition.
Especially for N=1, we only need a small corpus to identify the most frequent sentence beginnings.
select substring_index(sentence, ' ', 1) as beg, count(*) as cnt from sentences group by substring_index(sentence, ' ', 1) order by cnt desc limit 50;
4.3.1.2 Most Frequent Sentence Beginnings II
4.3.1.3 Most Frequent Sentence Beginnings III
4.3.1.4 Most Frequent Sentence Beginnings IV
4.3.1.1 Most Frequent Sentence Endings I
4.3.1.2 Most Frequent Sentence Endings II
4.3.1.3 Most Frequent Sentence Endings III
4.3.1.4 Most Frequent Sentence Endings IV